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Communication protocol and method for operating a VoIP 
communication with additional data transmission 

Technical field: 

[01] The present invention is related to the field of the telecommunications, in 
particular in connection with Voice over Internet (VoIP) applications. The 
invention is based on a priority application PCT/EP03/091 18 which is hereby 
incorporated by reference. 

Background of the invention: 

[02] Nowadays, the real-time transport protocols using RTP channels as 
described in IETF RFC 1889, such as for example the protocols known as H.323, 
Session Initiation Protocol (SIP) or H.248/Megaco, only consider call-related 
information exchanges between terminating points. 

[03] Indeed, these protocols consider signaling between communications 
terminals only for call set up and evolution. Once the RTP communication 
channel is established, there is no possibility to transmit additional data, i.e. data 
other than the audio and/or video data forming the payload of the exchanged 
streams. 

[04] When considering the foregoing context, one must differentiate the two 
planes involved in such communications, namely the signaling plane and the user 
plane. 

[05] In the signaling plane, data is exchanges between signaling gateways using 
the afore mentioned protocols (H. 323, SIP, .. .) in order to configure the 
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communication or session and to make it evolve if wanted or needed (invite 
another user, cut the communication, . . .). 

[06] In the user plane, data is exchanged between media gateways over a RTP 
channel, the useful data being the payload. 

[07] Nowadays, no possibility is provided to allow transmission of additional 
media related data over said RTP channel, which for example could allow to 
modify the use of the media. 

[08] One must point out that the H.323 protocol provides a specific H.320 
channel for exchanging such additional data. Nevertheless, using a proper channel 
for such purposes seems excessive and too resources consuming. 
[09] Furthermore, it must also be noticed that current IP networks do not 
provide means for inband end-to-end signaling. 

Summary of the invention: 

[10] The main aim of the present invention is to fill the lack expressed before 
and to provide a simple solution allowing user-to-user additional data exchange 
during communications over IP networks. 

[11] Therefore, the present invention concerns first a communication protocol 
suitable for bi-directional VoIP communication with media streams including 
audio and/or video data, and based on a real-time transport protocol (RTP) as 
described in IETF RFC 1889, wherein packets mainly comprised of a header part 
and a payload part are exchanged between at least two users, thus forming a RTP 
channel, characterised in that at least one sub-channel is embedded within the 
RTP channel, said sub-channel being adapted to carry command, signaling and/or 
information data. 
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[12] Preferably, the header part of each packet comprises at least one extension 
bit in a predetermined place, whereby allowing to provide one or several 
additional fields in the header or in a header extension of said packets to carry 
said command, signaling and/or information data. 

[13] The present invention also encompasses a method for operating a bi- 
directional VoIP communication over an IP network, based on a real-time 
transport protocol (RTP) as described for example in IETF RFC 1889, wherein 
media streams including audio and/or video data are exchanged, over a RTP 
channel, between at least two users, in the form of packets mainly comprised of a 
header part and a payload, characterised in that additional command, signaling 
and/or information data is transmitted through at least one sub-channel embedded 
within the RTP channel and available in both transmission directions. 
[14] As indicated before, additional fields are provided in the header part or in 
a header extension of the transmitted packets, in particular by setting at least one 
extension bit. 

[15] Thus, bi-directional user-to-user exchanges of information during a normal 

or a multimedia communication is made possible by creating sub-channels in a 

RTP channel to carry other types of data than said multimedia data. 

[16] For example, several kinds of sub-channels are possible: 

[17] - command sub-channel 

[18] - signaling sub-channel 

[19] - information sub-channel. 

[20] Advantageously, the bi-directional VoIP communication is operated on the 
basis of the protocol as described before. 
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Brief Description of the Drawings 

[21] A preferred structure of an RTP packet incorporating the additional fields 
proposed by the invention, is schematically shown on the enclosed figure. 

Detailed Description of the Invention 

[22] Thus, as can be seen from said figure, providing additional field(s) to carry 
said additional signaling data for transmission between users consists, in relation 
to the IETF RFC 1899 protocol features and for each transmitted packet, in setting 
the marker bit M and the extension bit X, in coding the payload type bits PT with 
the information of the user to user signals and in providing a header extension 
following the normal RTP header and comprising a profile indication field, a 
length indication field, a signaling type indication field and several bytes for 
receiving the additional data to be carried, the number of bytes corresponding to 
the value of the content of the length indication field. 
[23] When carrying out the method of the invention, the communication 
terminal of the user on the reception side analyses, upon receipt of a RTP packet, 
the header, in particular at least one extension bit, of the received packet and takes 
into account the command, signaling and/or information data contained in the 
additional fields of the said header or header extension. 

[24] The invention will now be described further on in a non limitative way and 
in relation to specific examples. 

[25] As exposed before, the main purpose of the invention is to introduce some 
additional information in media streams carried over a RTP channel. During bi- 
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directional VoIP communication, media streams including either audio or video 
information or both are sent through a RTP channel fully described in the IETF 
RFC 1889 document. 

[26] Current RTP channels convey only media streams with few additional 
information like timestamps and sequence numbers. 

[27] The present invention proposes a new scheme of this RTP channel 
basically built on its current description by adding signaling messages on both 
paths of the bi-directional communication. 

[28] To add information in the RTP channel, it is necessary to extend the field 
of information to be carried. This is done by adding a new element in the header 
indicated by an extension bit. This extension bit enables to create new fields in the 
header to multiply considerably the possibilities of the RTP channel. 
[29] At reception of such a RTP packet, the receiving terminal analyses the 
header and reacts accordingly in taking into account the information contained in 
the sub-channel. 

[30] By way of the two following examples of two possible scenarii of 
communications between two users A and B, the practical advantages of the 
invention will become apparent. 

[31] As a first example, a VoIP call between A and B is considered: 

[32] - A and B are talking and A asks B to spell a particular word, 

[33] - B could choose either to spell it vocally or to write it on the keyboard of 

this own terminal then to send to A, via an information sub-channel, by activating 

a command on his terminal, 
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[34] - at reception, the terminal of user A analyses the header and then display 
the information on the screen/display. 

[35] As a second example, a multimedia communication (voice + video) 
between A and B is considered: 

[36] - A and B are talking and use each their own motorized camera to show 
document and/or objects, 

[37] - A presents an object in the field of the camera connected to his terminal, 
[38] - B activates on his own terminal the joystick for zooming and/or 
modifying the angle of view, 

[39] - the commands are transmitted via the existing RTP channel (either voice 
or video channel) from terminal B to terminal/camera A on an embedded sub- 
channel, 

[40] - at reception, the terminal of user A analyses the header and then forwards 
the command to the camera. 

[41] The present invention is, of course, not limited to the preferred 
embodiment described and represented herein, changes can be made or 
equivalents used without departing from the scope of the invention. 



